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RNA-RNA and RNA-protein interactions 
in coronavirus replication and transcription 
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Abbreviations: CoV, coronavirus; sgRNA, subgenomic RNA; UTR, untranslated region; TRS, transcription-regulating sequences; 
SL, stem loop; DI RNA, defective interfering RNA; ORF, open reading frame; BSL, bulged stem-loop; PK, pseudoknot; CS, 
conserved core sequence; AG, thermodynamic free energy; nsp, non-structural protein; RTC, replication-transcription complex; 

hnRNP, heterogeneous nuclear ribonucleoprotein; RNP complex, ribonucleoprotein complex 


Coronavirus (CoV) RNA synthesis includes the replication 
of the viral genome, and the transcription of sgRNAs by a 
discontinuous mechanism. Both processes are regulated by 
RNA sequences such as the 5' and 3' untranslated regions 
(UTRs), and the transcription regulating sequences (TRSs) 
of the leader (TRS-L) and those preceding each gene (TRS- 
Bs). These distant RNA regulatory sequences interact with 
each other directly and probably through protein-RNA and 
protein-protein interactions involving viral and cellular 
proteins. 1 By analogy to other plus-stranded RNA viruses, such 
as polioviruses, in which translation and replication switch 
involves a cellular factor (PCBP) and a viral protein (3CD), 2 it is 
conceivable that in CoVs the switch between replication and 
transcription is also associated with the binding of proteins 
that are specifically recruited by the replication or transcription 
complexes. Complexes between RNA motifs such as TRS-L and 
theTRS-Bs located along the CoV genome are probably formed 
previously to the transcription start, and most likely promote 
template-switch of the nascent minus RNA to the TRS-L 
region. 3 Many cellular proteins interacting with regulatory CoV 
RNA sequences 4 are members of the heterogeneous nuclear 
ribonucleoprotein (hnRNP) family of RNA-binding proteins, 
involved in mRNA processing and transport, which shuttle 
between the nucleus and the cytoplasm. In the context of CoV 
RNA synthesis, these cellular ribonucleoproteins might also 
participate in RNA-protein complexes to bring into physical 
proximity TRS-L and distant TRS-B, as proposed for CoV 
discontinuous transcription. 5 ' 7 In this review, we summarize 
RNA-RNA and RNA-protein interactions that represent modest 
examples of complex quaternary RNA-protein structures 
required for the fine-tuning of virus replication. Design of 
chemically defined replication and transcription systems will 
help to clarify the nature and activity of these structures. 
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Introduction 

The replication and transcription strategies of RNA viral 
genomes involve intramolecular interactions between physi¬ 
cally close and distal domains, or even intermolecular regulatory 
contacts. The long-range and local RNA-RNA contacts result 
in tertiary structures that modulate the function of enhancers, 
promoters and silencers. In addition to the direct interactions 
within the viral RNA, a wide spectrum of contacts between viral 
and cellular proteins has been identified that leads to a complex 
interaction network required for the fine-tuning of virus replica¬ 
tion. This network conforms a higher order RNA-protein struc¬ 
ture. Nowadays, limited technological procedures have studied 
compartmentalized aspects of these structures that represent 
frozen states in a dynamic equilibrium. A widely used strategy 
to approach RNA structure is the application of predictive algo¬ 
rithms that consider the viral RNA genomes free in solution. 810 
Although these theoretical predictions are useful to approach sci¬ 
entific questions, the results withdrawn are necessarily limited to 
small RNA domains. More refined studies predict RNA struc¬ 
ture as it is being synthesized, 11 but the real situation, the RNA in 
contact with viral and cellular proteins, is only addressed using 
reduced RNA motifs. 

RNA viruses are classified according to the nature of their 
RNA genomes into plus-strand, minus-strand and double- 
stranded RNA viruses. Of these three classes, the most abun¬ 
dant are the plus-stranded RNA viruses. The first replication 
step in these viruses is the translation of their genome that acts 
as a viral mRNA. Viral RNA replication takes place in replica¬ 
tion complexes generally associated to intracellular membranes. 
Replication and transcription of several plus-strand RNA viruses 
require RNA motifs frequently located within the non-coding 
domains of the 5' and 3' ends of the genome. Plus-stranded RNA 
viruses replicate their genome through a continuous process that 
starts with the synthesis of an intermediate full-length minus- 
strand RNA, which then serves as the template for the asymmet¬ 
ric synthesis of progeny plus-strand genomes. During replication, 
cis acting replication elements (CREs) residing within the cod¬ 
ing sequence, 1216 or genome cyclization may be needed. 17,18 The 
cyclization of RNA viral genomes may be mediated by RNA 
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viral and cell proteins are being summarized in 
this review, in comparison with similar long¬ 
distance and local RNA-RNA and RNA-protein 
interactions necessary for the replication of other 
RNA virus genomes. This review can be comple¬ 
mented with previously published ones. 4,6,32 ' 34 
Due to the limited space, we have only reported 
selected representative examples of the different 
types of interactions described in plus-strand 
RNA viruses. 

Cis-acting RNA Motifs Involved 
in CoV RNA Synthesis 


Figure 1 . CoV genome structure and strategy of sgRNA expression. The upper line 
represents the TGEV genome structure. Letters above the bar indicate the gene names. L, 
leader sequence. AAA, poly(A). The shorter lines represent the viral mRNAs. Letters to the 
right of these lines indicate the gene encoded by each mRNA. Black lines represent the 
positive stranded RNAs, whereas gray lines represent the negative stranded RNAs. The 
triangles indicate the positions of the conserved core sequences (CSs) located upstream 
of each gene. 


interactions and also by RNA-protein and protein-protein inter¬ 
actions. Interestingly, long-range and local RNA-RNA interac¬ 
tions may be mutually exclusive, and may function as molecular 
switches controlling critical aspects, such as changes between 
replication, transcription and translation. 19 ' 22 

Nidoviruses include the Coronaviridae family with two sub¬ 
families, Coronavirinae and Torovirinae. The Coronavirinae 
incorporates three genera: a, (3 and y; see talk.ictvonline.org/ 
media/g/vertebrate-2008/default, aspx for official coronavi- 
rus taxonomy. The CoVs have the largest known viral RNA 
genome, a plus-strand RNA of 27 to 31 kb. Both viral genome 
and sgRNAs share common 5' and 3' termini that could enable 
these molecules to amplify by a partially characterized mecha¬ 
nism (Fig. 1). CoV genome includes a 5' end CAP and a 3' end 
poly (A) that may also promote 5' to 3' interactions, mediated by 
viral and cellular proteins. These interactions may be involved 
in the timely switches controlling CoV replication and tran¬ 
scription. These activities include the synthesis of minus-strand 
RNAs associated to double-layered membranes (DMVs). 23-25 

Replication and transcription processes require the translation 
and processing of a large protein complex, encoded by the 20 kb 
replicase gene, into up to 16 non-structural proteins (nsps). 7 During 
nidovirus transcription, a 3' nested set of sgRNAs of increasing size 
is synthesized (Fig. 1). Each sgRNA contains a 5' leader segment 
connected to a body sequence that is identical to the 3' end of the 
genome. CoV transcription requires discontinuous RNA synthesis 
that involves highly specific long distance interactions promoting 
a template switch leading to a high frequency recombination. 26,27 
This process is guided by transcription regulating sequences 
located at the 3' end of the leader (TRS-L) and preceding the body 
(TRS-B) of each gene. Accumulating evidence supports a model 
in which the discontinuous step in sgRNA synthesis occurs during 
minus-strand RNA synthesis. 3,27-31 

CoV genome 5 - and 3' ends and internal structural motifs 
involved in virus genome replication, and their interaction with 


Many features used by CoVs for genome repli¬ 
cation and transcription have been established 
using natural and artificially constructed defec¬ 
tive interfering (DI) RNAs that conserve the 
cis-acting signals required for their replication 
in trans by a helper virus. 35-41 Recently, reverse 
genetic systems have been developed for several 
CoVs, allowing the study of CoV replication and 
transcription mechanisms in the context of the viral genome. 42-47 

The minimal sequences required for genome replication have 
been defined in the three CoV genera by DI RNA deletion map¬ 
ping analysis and, in all cases these sequences were located at 
both ends of the genome. For transmissible gastroenterities virus 
(TGEV), 35,48 from genus a, mouse hepetitis (MHV) from genus 
(3, and infectious bronchitis virus (IBV) from genus y, these 
sequences expanded the first 466 to 649 nt at the 5' end of the 
genome, and the 388 to 493 nt plus the poly (A) tail at the 3' end. 
These regions of the genome contain secondary and higher-order 
RNA structures, named cis-acting RNA elements that interact 
with RNA motifs or proteins, and are located within the highly 
structured 5' and 3' UTRs of the CoV genome although, some¬ 
times, they are also embedded into the adjacent coding sequences. 

5' cis-acting RNA elements. Higher-order RNA structures in 
the 5' end of the CoV genome were first studied in the bovine 
coronavirus (BCoV), where six stem-loop structures, denoted 
SL-I to SL-VI (Fig. 2), were identified and confirmed by enzy¬ 
matic probing and functional mutational analysis using DI 
RNAs. 38,53-57 The first four stem-loops mapped within the 5' 
UTR, while the SL-V and -VI mapped into the adjacent ORF la 
coding sequence. A consensus secondary structural model for the 
first 5' UTR 140 nt, based on the sequence of nine CoVs repre¬ 
senting the three genera, has been proposed. 58 In this model, the 
region comprising the original SL-I contains two stem-loops that 
within this review have been named SL-Ia and SL-Ib, in order to 
maintain the original nomenclature proposed in previous reports 
for the next stem-loops (Fig. 2). Recently, it has been predicted 
that stem-loops SL-Ia and SL-Ib are structurally conserved in 
all known CoVs, 59 and their structures have been confirmed by 
nuclear magnetic resonance (NMR) spectroscopy in MHV and 
human CoV (HCoV)-OC43. 60-62 A general feature of SL-Ia is 
that it adopts a bipartite structure and, at least in MHV, the upper 
region of the stem-loop needs to form a double stranded stem to 
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support viral replication. The lower portion of this stem loop has 
an optimized lability that probably mediates a long-range interac¬ 
tion between the 5' and 3' ends of the genome critical for sgRNA 
synthesis, but not required for genomic minus-strand synthesis. 60 
SL-Ib is the most conserved structure in the CoV 5' UTR. It is 
composed of a 5 bp stem and a highly conserved loop sequence 
that adopts a YNMG-type tetra-loop conformation, which has 
an important role in MHV sgRNA synthesis. 61,62 

The predicted SL-II in the BCoV (named SL-III in other 
studies), 58,59,62 which includes the leader core sequence (CS) into 
the terminal loop, has a low free energy and is poorly conserved 
among CoVs. 54,62 In fact, SL-II seems non-structured in other 
CoVs. 59,62 Downstream to the leader CS is located SL-III, an 
RNA motif structurally conserved in all CoVs. 59 The presence 
of a large number of co-variations supports the existence of this 
stem-loop and particularly the upper half (designated SL-III in 
the BCoV model), 55 which has been confirmed by enzyme struc¬ 
ture-probing and is essential for replication of BCoV DI RNA. 55 
A bulged stem-loop (BSL) SL-IV has been predicted in all 
CoVs. 59 This BSL contains the start codon of ORF la preceding 
the downstream arm of this stem. The existence of this higher- 
order structure was demonstrated in BCoV by RNase mapping 
and, similarly to SL-III, the structure rather than the primary 
sequence was important for DI RNA replication. 56 In addition, 
both SL-III and SL-IV are targets for the binding of viral and 
cellular proteins, which may play a role in the function of both 
stem-loops. 55,56 

Recent studies with BCoV DI RNAs have shown that the 
5'-terminal 186 nt of ORF la coding region were required for DI 
RNA replication. 53 This region contained two predicted stem- 
loops, named SL-V and -VI (Fig. 2), with structures confirmed 


by RNase probing and supported by nucleotide co-variation 
among closely related genus (3 CoVs. 53,57 In the same study, sev¬ 
eral cellular proteins binding both stem-loops in vitro were iden¬ 
tified, although the functional role of these interactions has not 
been proven. 

y cis-acting RNA elements. The cis-acting RNA elements 
of the CoV 3' UTR have been extensively studied in the genus (3 
CoVs MHV and BCoV, and three higher-order structures have 
been demonstrated by chemical and enzymatic probing and by 
genetic studies with DI RNAs and recombinant viruses (Fig. 2). 
The first structure is a BSL of around 68 nt, beginning immedi¬ 
ately downstream of the N gene stop codon, which is essential 
for MHV viral RNA replication. 63,64 Adjacent to the BSL there is 
a hairping-type pseudoknot (PK) of around 54 nt that overlaps 
with the BSL in 5 nt and is also required for viral RNA rep¬ 
lication. 22,65 Despite considerable primary sequence divergence, 
overlapping BSL and PK sequences are conserved in all genus 
(3 CoVs and are functionally equivalent. 64,66,67 However, genus a 
CoVs contain a highly conserved PK but not a detectable BSL, 65 
and CoVs from genus y have only a conserved and functionally 
essential BSL. 39 These data indicate that CoVs from genera a and 
y must have developed alternative mechanisms to those control¬ 
ling genus (3 CoV replication. 3'-Downstream of the PK there is a 
hypervariable region (HVR) that is highly divergent both in pri¬ 
mary sequence and secondary structure even among closely related 
CoVs like MHV and BCoV. However, the HVR contains an octa- 
nucleotide sequence, 5 -GGA AGA GG-3', that is universally con¬ 
served among CoVs. In the case of MHV, the HVR consists in a 
complex multiple stem-loop structure expanding the last 160 nt of 
the genome (Fig. 2). 68 Mutational analysis of MHV HVR indi¬ 
cated that this stem-loop was required for DI RNA replication. 68 
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Figure 3. Alternative RNA structures at the MHV 3' UTR. Two RNA motifs that probably exist in a dynamic equilibrium are shown. The switch between 
both structures has been postulated in the initiation of CoV RNA synthesis. 75 (A) 3' UTR structure in which the BSL is folded and the 3' end is annealed 
to the base of PK loop {LI). (B) 3' UTR conformation in which the 3' end is released and the lower stem of BSL is disrupted leading to the formation of 
the PK. BSL, bulged stem-loop; PK, pseudoknot; 57 and LI, stem 1 and loop 1 of the pseudoknot; HVR, hypervariable region. Acronyms as in Figure 2. 


However, further genetic analysis in the context of the fulhlength 
genome showed that the deletion of the entire HVR, including 
the octanucleotide, had only a modest effect on genomic RNA 
replication in cell culture, although an effect in pathogenesis was 
observed. 69 This discrepancy could be explained by the competi¬ 
tive nature of DI replication that could increase the effect of muta¬ 
tions that only have a moderate impact in the context of the intact 
genome. Finally, the 3' poly(A) tail acts as a cis-replication signal 
for both MHV and BCoV DI RNAs. 70 In this study, a correlation 
between DI RNA replication and the ability to bind poly (A)-bind¬ 
ing protein (PABP) is reported. 

In other plus-strand RNA viruses, similarly to CoVs, evidence 
for higher-order structural elements in the 5' and 3' UTRs, or 
even in coding sequences far away from genome ends have been 
reported. These structural elements are involved in RNA replica¬ 
tion, transcription, translation and encapsidation and have been 
recently reviewed in reference 32. 

A new kind of RNA secondary and higher order structures, 
named genome-scale ordered structures (GORS), has also been 
identified in the coding sequences of picornavirus, calicivirus and 
flavivirus genomes. These RNA structures were not associated 
with translation or replication, but were correlated with the abil¬ 
ity of these viruses to persist in their natural host. 71,72 

Short-Distance RNA-RNA Interactions 
in CoV RNA Synthesis 

The cis-acting elements in viral RNA genomes have been con¬ 
ventionally viewed as static structures. However, recent research 
has changed this point of view providing evidence for coopera¬ 
tive activity involving short and long-distance RNA-RNA inter¬ 
actions, as well as conformational changes of these structures to 
control viral replication and transcription. 73,74 


Short-distance RNA-RNA interactions and RNA confor¬ 
mational changes have been reported at the 3' UTR of MHV. 
In addition to the PK described above, a predicted interaction 
between PK loop 1 (LI) and the 3' end of the genome has been 
reported (Fig. 3). 75 In the same study, evidence of an interaction 
of the replicase products nsp8 and nsp9 with the PK is presented. 
With this information, a model for the initiation of CoV minus- 
strand RNA synthesis has been proposed. According to this 
model, in the initial structure of the 3' UTR, the BSL is folded 
and the 3' end of the genome is annealed to the base of PK loop 
(LI) (Fig. 3A). The formed stem-loop binds to a protein complex 
that includes nsp8 and nsp9 causing a conformational shift that 
releases the 3' end of the genome and disrupts the lower stem of 
the BSL, leading to the formation of the PK (Fig. 3B). The new 
conformation is probably recognized by the viral RdRp and asso¬ 
ciated factors, promoting minus-strand RNA synthesis. The two 
postulated structures cannot simultaneously fold up, leading to 
the proposal that the BSL and the PK are components of a molec¬ 
ular switch that regulates different steps of RNA synthesis. 22,63 

A molecular switch has also been described in the equine 
arteritis virus, the arterivirus prototype. 76,77 A PK interaction 
between the arterivirus loop SL-5, located at the 3' UTR and the 
hairpin of SL-4, located just upstream of SL-5, at the 3' terminus 
of the N gene, are essential for viral RNA synthesis. Furthermore, 
it has been proposed that the formation of this PK may constitute 
a molecular switch that could regulate the specificity and tim¬ 
ing of viral RNA synthesis. Similar PK interactions near the 3' 
end have been predicted for all known arteriviruses. 77 In the por¬ 
cine reproductive and respiratory syndrome virus (PRRSV), an 
additional kissing loop between SL-4 and an upstream hairpin 
located within the N gene is required for replication. 78 However, 
a similar kissing loop interaction has not been predicted in other 
arteriviruses. Similar short-distance RNA-RNA interactions and 
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Figure 4. Postulated template switch during CoV transcription. Nascent minus RNA strand (gray) is synthesized by the RdRp using genomic RNA 
(black) as a template. After template switch (indicated by the dashed arrow), nascent minus RNA strand hybridizes with the TRS-L, to resume synthesis 
of the subgenomic RNA. Different viral and cellular proteins may be involved in this process. N protein is required for an efficient transcription, and 
may help template switch by acting as an RNA chaperone. 


RNA conformational changes required for RNA synthesis have 
been observed in other RNA virus families as indicated above. 

Long-Distance RNA-RNA Interactions Involved 

in CoV RNA Synthesis 

Long-distance RNA-RNA interactions mediate CoV RNA syn¬ 
thesis. CoV discontinuous transcription process implies a pre¬ 
mature termination during the synthesis of the minus stranded 
RNAs and a template switch of the nascent minus RNA to the 
leader region. This switch requires long distance RNA-RNA 
interactions, probably assisted by RNA-protein complexes that 
would bring in close proximity the 5' end TRS-L and the TRS-B 
located 5' upstream of each gene (Fig. 4). These complexes, 
formed previously to template switch, might contribute to the 
stop of the minus RNA synthesis at TRS-B sequences. 3,5,31 

After this template switch takes place, a copy of the leader 
sequence is added to the nascent RNA by a discontinuous synthe¬ 
sis step. 29,79,80 The requirement of base pairing between the newly 
synthesized minus RNA and the TRS-L to facilitate transcrip¬ 
tion has been described in arteriviruses 27,30 and CoVs. 3,27,31,81,82 
The TRSs include a conserved CS and 5' and 3' variable flank¬ 
ing sequences. The template switch takes place after copying the 
CS sequence. A three RNA strand intermediate could be formed 
before template switching takes place, including the RNA tem¬ 
plate (TRS-B), the nascent negative stranded RNA (cTRS-B) 
and the leader region (TRS-L). If the AG associated to the forma¬ 
tion of TRS-L and cTRS-B duplex is above a minimum thresh¬ 
old, the template switch may take place. 3,31 

CoV transcription resembles a similarity assisted copy-choice 
RNA recombination mechanism, 26,27 being the TRS-L the accep¬ 
tor RNA and cTRS-B the donor RNA. This recombination 
mechanism requires sequence homology between the donor and 
acceptor RNAs and a hairpin structure present in the acceptor 
RNA. 83-85 Recently, it has been reported that a template switch 
may also take place within CoV sgRNAs in the absence of the 
standard TRS-L structure, an innovative result that may require 


additional assessment. 86 In torovirus, the discontinuous synthe¬ 
sis is also required for sgRNA2 synthesis. In this case, sequence 
similarity and the presence of a hairpin structure are needed. 87 

During TGEV transcription, a good correlation has been 
observed between the sgRNA levels and the AG values of the 
duplexes between TRS-L and each cTRS-B, with only one excep¬ 
tion in the synthesis of N gene sgRNA. This sgRNA was the most 
abundant, despite its lower AG value. 88 This exception led to the 
discovery in our laboratory of a cis-acting element upstream of N 
gene TRS-B that specifically regulated N gene transcription by a 
long-distance RNA-RNA interaction (Fig. 5). 88 The complemen¬ 
tarity between the proximal element (pE) located 6 nt upstream 
of CS-N and the distal element (dE), located 449 nt upstream of 
TRS-N core sequence (CS-N), was essential to increase the level of 
sgRNA-N. Interestingly, the AG value of this interaction was more 
relevant than the pE and dE primary sequences. This novel tran¬ 
scription regulatory mechanism is present in the CoV genus a, in 
which the N gene is not the most 3' one at the end of viral genome. 
This RNA-RNA interaction most likely promotes an increase in 
the transcription complex stop at the TRS-B located preceding gene 
N, during the synthesis of the minus-strand RNA (Fig. 5). This 
interruption would specifically enhance the frequency of the tem¬ 
plate switch to the leader during sgRNA N synthesis. This obser¬ 
vation is reminiscent of that previously reported by White’s group 
for transcription in tombusviruses by a premature termination 
mechanism. 33,74,89 A similar long-distance RNA-RNA interaction 
between two genomic RNAs promoting transcription was previ¬ 
ously described in red clover necrotic mosaic virus (RCNMV). 90 
In flock house virus (FHV) RNA production is also regulated by 
an intramolecular RNA-RNA interaction. 91 

Long-distance RNA-RNA interactions between RNA 
sequence motifs mapping at distal positions of the genome (800 
nt or more) that regulate replication and transcription, have also 
been described in other RNA viruses. In fact, potato virus X 
replication requires the involvement of RNA motifs, such us a 
hexa-nucleotide at the 3' end of the genome and a central con¬ 
served sequence, into different long distance interactions. 92,93 In 


www.landesbioscience.com 


RNA Biology 


241 





































Downloaded by [120.198.125.74] at 16:42 02 December 2015 



CS-L 


dE 


gRNA 5’ 


pE CS-N 


432nt 


3’ 


5’(+) 

3’(-) 


CS-L 


c 


dE 


38888888 


) 


CS-N 


38888888 


PE 



cTRS-N 




3’(+) 

5’(-) 


Figure 5. Working model for the specific regulation of TGEV sgRNA-N transcription. The upper line 
represents the TGEV genome (not to scale) including the core sequence (CS-L) located at the 3' 
end of the leader, the distal and proximal elements (dE and pE, respectively) and the CS of N gene 
(CS-N). The number below the arrow represents the distance between dE and pE elements. The 
lower lines represent the long distance RNA-RNA interaction within the genome that may act as a 
stop signal during the synthesis of the negative stranded sgRNA-N (gray line). The dashed arrow 
indicates the template switch step. cTRS-N, sequence complementary to the CS-N and the 3' TRS-N 
flanking sequences. 


addition, plus-strand genomic and sgRNA synthesis also require 
interactions between an octa-nucleotide located at the genome 
5' end, and the central conserved sequences indicated above. 94 
In hepatitis C virus (HCV), genome replication involves RNA- 
RNA interactions between a cis-acting element located within 
the coding sequences and other structural motifs inserted up and 
downstream to this element. 12,19 

Genome cyclization is the most distal intramolecular interac¬ 
tion possible within two domains in a viral genome. This type of 
contact has been described in several RNA virus systems, includ¬ 
ing CoVs, as a prior step to genome replication and transcrip¬ 
tion. Previous studies on MHV DI RNA replication, indicate 
that the last 55 nt of the 3' end plus the poly(A) tail are sufficient 
to promote the synthesis of the negative-strand, while both the 
5' and 3' ends of the genome are required for plus-strand RNA 
synthesis. 95 This observation led us to postulate an interaction 
between the 5' and 3' ends of the genome for CoV replication. 
A direct RNA-RNA interaction between genome ends has been 
predicted for MHV, TGEV and severe and acute respiratory syn¬ 
drome CoV (SARS-CoV) using computer programs designed to 
analyze full-length genomes. 96,97 Genetic evidence supporting 
a 5-3' end cross-talk that stimulates sgRNA synthesis 60 came 
from a study on MHV, in which it was suggested that the base of 
stem loop SL-Ia, located at the 5' UTR (Fig. 2), mediates a long 
range interaction with the 3' end. In addition, experimental evi¬ 
dence reported for MHV 98 and TGEV (Almazan F, Galan C and 
Enjuanes L, unpublished results) suggests that the interaction 
between 5' and 3' ends may also be mediated by RNA-protein 
interactions as described below. 

Genome cyclization mediated by RNA-RNA, RNA-protein 
and protein-protein interactions has also been described in other 
positive stranded RNA viruses. In poliovirus genome circulariza¬ 
tion the PABP binds to the 3' poly (A) tail and interacts with the 
protease-polymerase precursor (3CD) bound to the 5' cloverleaf 


RNA. 99 This circularized ribonucleo- 
protein is required to initiate the syn¬ 
thesis of the negative stranded RNA. 100 
In contrast, the binding of the cellular 
poly(rC) binding protein (PCBP) to the 
5' cloverleaf RNA promotes the trans¬ 
lation. 101 Similarly, in pestivirus such 
as bovine viral diarrhea virus (BVDV), 
genome circularization is also mediated 
by three proteins of the NFAR group. 
This circularization promotes the switch 
from translation to replication. 102 Foot 
and mouth disease virus (FMDV) uses 
specific higher-order RNA structures 
located at the termini of viral genome 
to regulate the translation and viral 
RNA synthesis. The replication process 
requires direct 5' to 3' end RNA-RNA 
interactions mediated by the S region 
preceding the IRES, the IRES 5' end 
and the 3' end of the genome. 103 In addi¬ 
tion, RNA-protein interactions also play 
an important role connecting both ends. 

In flaviviruses, genome circularization is mediated by 
direct long-distance RNA-RNA interactions between comple¬ 
mentary sequence motifs present at both ends of the genome. 
Complementarity disruption impairs RNA synthesis, providing 
evidence for the requirement of circularization during the repli¬ 
cation process. 17,20,104 

The long-distance RNA-RNA interactions described until 
now probably represent the tip of the iceberg of a complex 
network involving complete RNA virus genomes. These long¬ 
distance interactions probably lead to a tertiary structure of the 
full-length virus genome. The best paradigm of this theory is 
probably reflected in the pioneering studies on tomato bushy 
stunt virus (TBSV), for which a global RNA structure has been 
proposed to regulate replication, transcription and translation. 33,89 

RNA-Protein Interactions Involved 
in CoV RNA Synthesis 

The identification of viral proteins participating in coronavi- 
rus RNA synthesis was traditionally based on the analysis of 
temperature-sensitive mutant viruses. This approach led to the 
identification of several replicase-encoded nsps as key factors in 
replication and transcription. 1,105 Cell factors involved in RNA 
synthesis have also been identified by studying their binding to 
selected regions of the genome, 1,106 or to replicase proteins. 107,108 
Recently, high-throughput assays have been used for the iden¬ 
tification of viral and cellular factors affecting CoV replication. 
These technologies include genome-wide two-hybrid screen¬ 
ings 109,110 and proteomic analyses 111,112 that identify interactions 
between replicase components, probably relevant in CoV replica¬ 
tion. Alternatively, strategies similar to the ones applied to other 
plus-stranded RNA virus genomes may also be useful to study 
CoV replication. This is the case of high-throughput functional 
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assays using host cell mutants or small interfering RNAs (siR- 
NAs) affecting viral RNA amplification and expression applied 
to HCV, 113,114 and human immunodeficiency virus (HIV). 115,116 

One of the largest limitations in the identification of factors 
affecting replication or transcription is to distinguish between 
direct and indirect effects. In addition, factors affecting RNA syn¬ 
thesis frequently also affect translation. 117 Therefore, it becomes 
essential to study the role of factors in defined functional assays, 
such as in vitro replication or transcription systems. The purifica¬ 
tion of in vitro active CoV replication-transcription complexes 
(RTCs) has been described in reference 118. This system could 
be used to analyze the role of viral and cell factors in RNA syn¬ 
thesis. Although both viral and cellular proteins are most likely 
part of the purified active complexes, the precise components that 
reproduce the activity still need to be defined. An alternative is to 
develop in vitro systems based on defined components. The key 
difficulty for the development of these CoV in vitro systems is the 
requirement of a large number of components. 

Viral proteins involved in CoV RNA synthesis. Among the 
viral proteins required in CoV RTC, it has been proposed that 
the RdRp (nspl2), the helicase (Hel, nspl3) and N protein are 
essential. 5 In addition, other viral proteins probably contribute to 
the formation of this complex and the regulation of its activity. 
In fact, several replicase proteins such as nsp3, nsp4 and nsp6 
have mainly scaffolding and membrane rearrangement func¬ 
tions. 7,119 ' 121 In addition, nsp3 and nsp5 act as proteases for the 
correct processing of replicase polyproteins. 7 

Arterivirus nspl binds RNA and has been involved in tran¬ 
scription, 122,123 probably by modulating the relative abundance 
of subgenomic and genomic RNAs. 124 In CoVs, the binding of 
nspl to a BCoV cis-acting replication signal has been reported in 
reference 57. Nevertheless, nspl protein has not been involved in 
RNA synthesis. In line with this observation, the replicase of IBV, 
a member of CoV genus y, does not encode a nspl, 7,125 suggesting 
that CoV nspl is probably dispensable for CoV RNA synthesis. 

Nsp2 is recruited to the RTC. 126 Although nsp2 is dispens¬ 
able for genus (3 CoV replication, a 50% reduction in viral RNA 
synthesis is observed when this protein is deleted. 127 Therefore 
the role of nsp2 in CoV RNA synthesis still remains to be fully 
established. 

Nsp3 is a multifunctional protein that binds RNA. 111,128 The 
role of nsp3 in CoV RNA synthesis remains unclear, as most of its 
domains are accessory for RTC activity, although they may play 
a role in pathogenesis. 129,130 The interaction between nsp3 and 
N protein has been recently described, and it has been proposed 
that nsp3 is involved in the location of CoV RNA genome within 
the RTC complex at early stages of RNA synthesis. 131 SARS-CoV 
E protein binds the N-terminal acidic domain of nsp3, but its 
specific role in CoV replication remains to be stablished. 132 

It has been proposed that nsps 7 to 10 form a functional cas¬ 
sette involved in virus RNA synthesis. 133 Nsp7 and nsp8 form a 
hexadecameric complex that has structural properties compatible 
with that of a processivity factor for RdRp. 134 Nsp8 has RdRp 
activity and probably produces the primers required for nspl2 
mediated RNA synthesis. 135,136 Nsp9 is a single-stranded RNA 
binding protein that may stabilize viral RNAs during RNA 


synthesis and processing. 137,138 Nsp9 forms a homodimer and its 
oligomerization is required for viral replication. 139 MHV nsp8 
and nsp9 interact with RdRp, 140 and genetic evidence suggest 
that they also interact with the RNA PK at the 3' end of the viral 
genome. 75 NsplO contains two Zn-finger motifs, binds nucleic 
acids non-specifically, 141,142 and has been implicated in minus- 
strand RNA synthesis. 143 

Nsps 14, 15 and 16 encode RNA-processing activities that are 
unique to the RNA virus world, as exonuclease (ExoN), endoribo- 
nuclease (EndoU) and methyltransferase (MT) activities, respec¬ 
tively. 7 All these activities are required for efficient CoV RNA 
synthesis. 43,144 146 It has been proposed that the presence of these 
activities, especially nspl4, allows the maintenance of functional 
CoV genomes of large size by acting as a proofreading system. 125,147,148 
CoV nspl4 also encodes a MT activity at its C-terminus, required 
for RNA synthesis. 149 It is possible that MT activities of nspl4 and 
nspl6 act coordinately for viral mRNA capping. 150 

CoV N protein forms a ribonucleoprotein complex with 
genomic RNA. In addition to its structural role, this protein has 
also a prominent role in RNA synthesis. N protein colocalizes 
with the RTC. 151,152 Initial observations suggested that N protein 
plays a role in RNA synthesis 153 158 and additional studies con¬ 
firmed that this was the case. 159 " 161 These results are in agreement 
with the significant increase of infectious virus rescue from dif¬ 
ferent CoV cDNA clones by providing N protein in trans. 47,160,162 
The N-terminal domain of CoV N protein binds specifically to 
TRS-L sequences and facilitates unwinding of TRS duplexes. 163 
CoV N protein also has RNA chaperone activity 164 that enhances 
template switch in vitro. 165 As a consequence of these activities, it 
has been postulated that N protein may facilitate the CoV tem¬ 
plate switch step during CoV transcription. 163,165 In fact, it has 
been recently reported that N protein is required for an efficient 
transcription but is not essential for CoV RNA replication. 165 

Cellular proteins involved in CoV RNA synthesis. Cellular 
proteins associated with CoV RNA synthesis are reviewed below 
according to their binding to 5' UTR, internal TRS-Bs, 3' UTR 
or to viral replicase components. Protein binding to 5' or 3' UTR 
might be involved in viral replication and also in transcription, 
translation and viral RNA stability. Binding to internal TRS-Bs 
might be related to transcription or post-transcriptional steps. 

Cellular proteins binding CoV 5' UTR. In MHV, belonging 
to CoV genus (3, polypyrimidine-tract binding (PTB) protein, 
also known as hnRNP I, binds pentanucleotide repeats of leader 
TRS, located in the 5' UTR (nt 56—112). Deletion of these leader 
repeats in DI RNAs significantly inhibited RNA transcription, 
suggesting that PTB might be involved in transcription regula¬ 
tion. 166 Functional studies showed delayed virus production and 
reduction of viral RNA synthesis in MHV infected cells overex¬ 
pressing PTB. This protein interacts with viral N protein both 
in vitro and in vivo, suggesting a possible contribution to the 
ribonucleoprotein (RNP) complex formation. 167 PTB also binds 
to the TRS-L sequences at the 5' end of TGEV genome 106 (Sola 
I and Enjuanes L, unpublished results). hnRNP A1 specifi¬ 
cally binds to MHV RNA consensus sequences present in viral 
genome negative RNAs cTRS-L and cTRS-B, complementary 
to TRS-L and TRS-B RNA motifs, respectively. 168 Another 
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Table 1. Illustrative examples of cellular proteins binding CoVTRSs 
identified by RNA-affinity chromatography and mass spectrometry 

Biological process 3 
mRNA splicing 

Transcriptional regulation 
mRNA binding and processing 
Translational regulation 
Cytoskeleton-movement 
Metabolic processes 
a Gene ontology classification of biological processes. 

member of hnRNP family, SYNCRIP or hnRNP Q, binds to 
MHV 5' UTR or to its complementary sequence c5' UTR. 169 
Glyceraldehyde-3-phosphate dehydrogenase (GAPDH) was also 
identified in association with TGEV TRS-L (Table 1). In addi¬ 
tion, the knockdown of the GAPDH gene produced a positive 
effect on CoV-replication both in a TGEV replicon and full- 
length infectious virus systems, suggesting a potential role in the 
balance of cellular factors that positively influence TGEV RNA 
synthesis. 106 This hypothesis is supported by the formation of a 
complex including GAPDH, the bifunctional aminoacyl-tRNA 
synthetase (EPRS), hnRNP Q and the ribosomal protein LI3a, 
in an inflammatory response. 170 

Cellular proteins binding CoV TRS-B. hnRNP A1 binds to 
the complementary RNA strand of MHV TRS-L (cTRS-L) 
and of several TRS-Bs (cTRS-Bs). 168,171 Mutagenesis of TRS-B 
sequences in a DI RNA reporter system resulted in reduced tran¬ 
scriptional activities that correlated with relative binding affini¬ 
ties of the cTRS-B sequences to hnRNP Al, suggesting that this 
protein may be a transcription factor in CoV RNA synthesis. 168,171 
Nevertheless, it cannot be ruled out that the observed effect was 
due to changes in the extent of the complementarity between the 
newly synthesized cTRS-B and TRS-L. In an in vitro reconstitu¬ 
tion assay, hnRNP Al mediates the formation of an RNP com¬ 
plex with the minus strand cTRS-L and cTRS-B. 172 Furthermore, 
hnRNP Al interacted both in vitro and in vivo with MHV N 
protein, that forms part of the RTC. 173 

We have used TGEV CoV as a model to identify candidate 
cellular proteins participating in CoV transcription. Leader and 
body TRSs (TRS-L and TRS-B, respectively) with either posi¬ 
tive or negative polarity, as well as RNA duplexes mimicking 
TRS-L/cTRS-B hybrids formed during transcription template 
switch, were used in RNA affinity chromatography assays. 
Mass spectrometry analysis led to the identification of around 
40 cellular proteins interacting with TGEV TRSs. A major 
proportion of these proteins was associated with cellular RNA 
metabolism, including pre-mRNA processing (45%), transcrip¬ 
tion (8%) and translation (20%) (Fig. 6). Members of hnRNP 
family or the DEAD box family of cellular RNA helicases 
(DDX and DHX) were extensively represented. hnRNP I was 
found in association with the plus-stranded RNA of TRS-L and 
several TRS-B and, therefore, PTB may participate in the regu¬ 
lation of sgRNA synthesis (Sola I and Enjuanes L, unpublished 
results). Other identified proteins were related to cytoskel- 


eton (16.7%) or cell metabolism processes (3.3%) (Fig. 6 and 
Table 1). Some of the identified proteins have already been 
involved in the life cycles of other plus-stranded RNA viruses 
and may provide novel insights into the protein machinery uti¬ 
lized by CoVs in RNA synthesis. 174 

Cellular proteins binding CoV 3' UTR. UV-crosslinking exper¬ 
iments showed binding of hnRNP Al to MHV genome 3' UTR 
sites located at nt 90—170 (mapping within the HVR domain) 
and nt 260—350 numbered from the 3' end of the genome (par¬ 
tially overlaping with the BSL). 98 Overexpression of hnRNP Al 
or of a dominant-negative mutant derived from this protein, 
accelerated or delayed, respectively, viral RNA synthesis, suggest¬ 
ing that hnRNP Al participates in the formation of the RTC. 175 
hnRNP Al is translocated from the nucleus to the cytoplasm 
of infected cells, probably as a result of specific interaction with 
MHV RNA. 168 Other member of the hnRNP A2/B1 multigene 
family can substitute hnRNP Al in MHV replication. 176 hnRNP 
Al binding sites are complementary to those on the negative- 
strand RNA that bind PTB. Partial deletion of hnRNP Al and 
PTB binding sites abolished sgRNA synthesis by MHV DI 
RNAs. 177 hnRNP Al and PTB also bind to the complementary 
strands at the 5' end of MHV RNA and mediate the in vitro for¬ 
mation of a RNP complex involving the 5' and 3' end fragments 
of MHV RNA. 98 

Other cellular proteins bind the CoV 3' end genome and 
also affect virus replication. Thus, PABP binds to BCoV 70 and 
TGEV (Marquez-Jurado S, Enjuanes L and Almazan F, unpub¬ 
lished results) poly (A). Furthermore, a positive effect of PABP 
in TGEV RNA synthesis has been observed in reference 106. 
Other 3'-interacting host factors such as the hnRNP Q and the 
EPRS also positively contribute to TGEV RNA synthesis, as 
shown in functional analysis with siRNAs using both a TGEV 
derived replicon and the infectious virus itself. 106 Transcriptional 
coactivator plOO (SND1) specifically binds the 3' end of the 
TGEV, 106 and also the TRS-L and TRS-B RNA motifs (Table 
1) (Sola I and Enjuanes L, unpublished results). PlOO protein 
interacts with the nspl of EAV, specifically involved in mRNA 
synthesis. 178 However, no functional studies have shown the 
role of plOO in arterivirus transcription. Mitochondrial proteins 
aconitase, heat shock protein (HSP)40, HSP60 and HSP70 
bind to MHV 3'-most 42 nt of RNA genome, although their 
requirement for CoV replication was not directly addressed. 179,180 
Specific mutations in this 3' UTR sequence led to a virus with a 
defect in sgRNA synthesis, 181 supporting the relevance of this 3' 
end sequence domain in virus transcription. 

Cellular proteins binding CoV replicase subunits. An alterna¬ 
tive approach to identify host factors that participate in CoV 
RNA synthesis relies on their specific binding to viral replicase 
subunits. Several members of the cellular DEAD box family 
of helicases such as DDX5 and DDX1 have been associated 
with CoV RNA synthesis. These helicases have been involved 
in several biological processes related to RNA transcriptional 
regulation, pre-mRNA processing, RNA degradation, RNA 
export, ribosome assembly and translation. 182 The specific inter¬ 
action of cellular DDX5 protein with the SARS-CoV helicase 
protein (nspl3) has been shown using yeast and mammalian 


Protein name 

hnRNP Al, hnRNP A2/B1, hnRNP 
A3, hnRNP I, hnRNP Q 

ILF2, ILF3, SND1 

Nucleolin, DDX1,DHX15 

elFla, elF3S10 

Tubulin, annexin A2, moesin 
GAPDH 
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Figure 6. Cell proteins with affinity for TGEV RNA structural motifs. Functional classification of cellular proteins interacting with CoV TRSs. RNA-affinity 
chromatography assays were performed with TGEV-infected human Huh7 cell extracts and viral TRS RNAs. Forty proteins interacting with TRS RNAs 
were identified by mass spectrometry and categorized according to the biological processes in which they are involved. 


two-hybrid assays and co-immunoprecipitation. Silencing of 
DDX5 expression by siRNAs significantly reduced viral RNA 
replication and virus titers. 107 DDX1 interacted with IBV and 
SARS-CoV exonuclease (npl4), as demonstrated by yeast two- 
hybrid and coimmunoprecipitation assays. The effect of DDX1 
expression knockdown in CoV RNA replication and transcrip¬ 
tion indicated that DDX1 might be a cofactor essential for these 
activities. 108 Interestingly, RNA helicase DDX1 has also been 
identified in association with TGEV TRSs (Sola I and Enjuanes 
L, unpublished results) and might perform common functions 
in CoV replication. 

Perspectives 

Recent advances in understanding the molecular mechanisms of 
replication and transcription in positive-stranded RNA viruses 
indicate that regulatory elements in viral genomes are dynamic 
complexes involving short and long-distance RNA-RNA interac¬ 
tions and also RNA-protein complexes including viral and cel¬ 
lular proteins. Transition between two structural conformations 
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